1 /9 



FIG. 1 



101 



MEMORY 

i p : 



i SPEECH 
i SAMPLES 
iPOR 

! TRAINING 



105a 



105 



102 



CPU 



i: 



103 



KEYBOARD/ 
DISPLAY UNIT 



i NOISE 
: SAMPLES 
iNOi~NOM 



106 




104 



i UNTRAINED : 
i ACOUSTIC i 
■MODELS ' 



2/9 



FIG. 2 



106a 



PLURALITY OF NOISE SAMPLES 



N01 / N02 / N03 / N04 / N05 



NOm ^ 







.S21 




EXECUTE SPEECH ANALYSIS OF NOISE 
SAMPLES N0i~N0m 






/S22 



COMPUTE TIME-AVERAGE VECTOR 







CATEGORIZE M TIME-AVERAGE 
VECTORS INTO N CLUSTERS 






SELECT ONE NOISE SAMPLE IN EACH 
CLUSTER TO STORE SELECTED NOISE 
SAMPLES 




—jl 







S23 



.S24 



PLURALITY OF NOISE SAMPLES FOR TRAINING 



NL1 / NL2 / NL3 



NLn 



3/9 



FIG. 3 

( STEP S23 ) 



SET M TIME-AVERAGE VECTORS TO M 
CLUSTERS 



.S23A 



COMPUTE EACH DISTANCE BETWEEN 
EACH CLUSTER 



S23B 



EXTRACT AT LEAST ONE PAIR OF TWO 
CLUSTERS WHOSE DISTANCE IS THE 
SHORTEST IN OTHER PAIRED TWO 
CLUSTERS 



S23C 



LINK TWO EXTRACTED CLUSTERS TO 
SET LINKED CLUSTERS TO SAME 
CLUSTER 



.S23D 



/S23E 



IS NUMBER OF 
CLUSTERS ONE ? 



I YES 



PRODUCE DENDROGRAM 



S23F 



CUT DENDROGRAM ON BROKEN LINE 
C-C TO CATEGORIZE NOISE SAMPLES 
INTO N CLUSTERS 



.S23G 



(STEP S24) 



4/9 




5/9 



FIG. 



N01 RIVER 

NO2 MUSIC 

NO3 MARK II 

NO4 COROLLA 

NO5 ESTIMA 

N06 MAJESTA 

NO7 PORTOPIA HALL 

N08 DATA SHOW HALL 

NO9 SUBWAY 

NO10 DEPARTMENT 

NO11 BUSINESS OFFICE 

NO12 LABORATORY 

NO13 BUZZ-BUZZ 

N0i4 OFFICE 

NO15 STREET FACTORY 

NO16 KINDERGARTEN 

N017 TOKYO STATION 



6/9 



FIG. 6 

( START ) 



EXTRACT ONE OF NOISE 
SAMPLES NLi—NLn ? 



S30 



105 



107 



PLURALITY OF NOISE 
SAMPLES FOR^TRAINING 



SUPERIMPOSE EXTRACTED 
NOISE SAMPLE ON SPEECH 
SAMPLES 




EXECUTE SPEECH ANALYSIS 
OF NOISE SUPERIMPOSED 
SPEECH SAMPLE DATA TO 
OBTAIN CHARACTERISTIC 
PARAMETERS 



S32 



TRAIN UNTRAINED ACOUSTIC 
MODEL USING CHARACTERISTIC 
PARAMETERS 



S33 




S35 



YES 



TRAINED 

ACOUSTIC 

MODELS 



-108a 



'110 



7/9 



STANDARD PATTERN /SA/. 



FIG. 7 

/SA/ 



V 



NOISE SUPERIMPOSED ^ , , ^ . . 
SPEECH SAMPLE DATA /A/ / \ /SAA;/HI/ 



NOISE SUPERIMPOSED 

SPEECH SAMPLE DATA /BI/ - /SA/ 



/I/ 



F/G. S 



151 



MEMORY 

! PI ! 



152 



CPU 



1 



INPUTTING 
UNIT 



155 



DICTIONARY 
DATABASE 



150 

153 



KEYBOARD/ 
DISPLAY UNIT 



154 



TRAINED 

ACOUSTIC 

MODELS 



7^ 
156 



157 



110 



8/9 



FIG. 9 



INPUTTED UNKNOWN 
SPEECH SIGNAL 



/S61 



EXECUTE SPEECH ANALYSIS 
OF INPUTTED SPEECH SIGNAL 
BY FRAME 



S62 



PERFORM DP MATCHING IN 
ACCORDANCE WITH 
SYLLABLES 



"P^S63 
RECOGNIZED RESULT 



DICTIONARY 
DATABASE 



'156 



ACOUSTIC 
MODELS 



157 



110 



FIG. 10 PRIOR ART 



201 



SPEECH 

SAMPLES 

FOR 

TRAINING 



205a 



205 



202 



MEMORY 




CPU 





203 



KEYBOARD/ 
DISPLAY UNIT 



204 




ACOUSTIC 
MODELS 



207a 207 



9/9 



FIG. 11 PRIOR ART 



( START ) 



/S81 



SUPERIMPOSE NOISE SAMPLE 
ON SPEECH SAMPLES 



S82 



PERFORM SPEECH ANALYSIS 







TRAIN UNTRAINED ACOUSTIC 


MODEL 





205a 



206a 




NOISE SAMPLES 
FOR TRAINING 
206 



TRAINED 

ACOUSTIC 

MODELS 



^ T 

207a 210 



